PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_1951_f_2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 759aa    MW: 82925 Da    PI: 6.0569
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_1951_f_2genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox63.53.2e-2091146156
                    TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
       Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                    +++ +++t++q++e+e++F+++++p+ ++r+eL+++lgL+  qVk+WFqN+R+++k
  Neem_1951_f_2  91 KKRYHRHTQHQIQEMETFFKECPHPDDKQRKELSRELGLEPLQVKFWFQNKRTQMK 146
                    688999***********************************************999 PP

2START2244.5e-702805001206
                    HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
          START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                    ela +a++el+++a+ ++p+W++      + +n++e++++f+++ +     ++ ea+r+s+vv+m++  lve+l+d++ qW++ +     +a+tle
  Neem_1951_f_2 280 ELAVAAMEELFRMAQMGQPLWMTGLdgttSVLNEEEYVRTFPRGIGpkptgFKCEASRESAVVIMNHISLVEILMDVN-QWSTVFSgivsRAMTLE 374
                    57899********************999999************999********************************.***************** PP

                    EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE- CS
          START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdl 172
                    v+s+g      galq+m++e+q++splvp R++++vRy++q+g+g+w++vdvS+d+ ++ p     vR++++pSg+li++++ng+skvtwvehv++
  Neem_1951_f_2 375 VLSTGvagnynGALQVMTSEFQVPSPLVPtRESYYVRYCKQHGEGTWAVVDVSLDNLRPSPA----VRCRRRPSGCLIQEMPNGYSKVTWVEHVEV 466
                    ***********************************************************995....****************************** PP

                    -SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
          START 173 kgrlphwllrslvksglaegaktwvatlqrqcek 206
                    ++r +h+l+++lv+sg+a+gak+wvatl+rqce+
  Neem_1951_f_2 467 DDRGVHNLYKQLVSSGHAFGAKRWVATLDRQCER 500
                    ********************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.4E-2270146IPR009057Homeodomain-like
SuperFamilySSF466892.55E-1979148IPR009057Homeodomain-like
PROSITE profilePS5007116.7388148IPR001356Homeobox domain
SMARTSM003891.8E-1989152IPR001356Homeobox domain
CDDcd000868.21E-1991149No hitNo description
PfamPF000467.9E-1891146IPR001356Homeobox domain
PROSITE patternPS000270123146IPR017970Homeobox, conserved site
SuperFamilySSF559612.3E-36271502No hitNo description
PROSITE profilePS5084844.549271503IPR002913START domain
CDDcd088756.09E-129275499No hitNo description
SMARTSM002348.6E-67280500IPR002913START domain
PfamPF018521.8E-59281500IPR002913START domain
Gene3DG3DSA:3.30.530.208.0E-6379482IPR023393START-like domain
SuperFamilySSF559614.31E-24521751No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010090Biological Processtrichome morphogenesis
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 759 aa     Download sequence    Send to blast
MPTGVMIPAR NMPSVIGRNG NVGGLGSSSG GLSLSQPNMM EGQLHPLDMT QNTSESEIAR  60
LREEEFDSTK SGSENHEGAS GDDQEQQPPK KKRYHRHTQH QIQEMETFFK ECPHPDDKQR  120
KELSRELGLE PLQVKFWFQN KRTQMKTQHE RHENTQLRAE NEKLRADNMR YREALSNASC  180
PNCGGPTAIG EMSFDEHHLR LENARLREEI DRISAIAAKY VGKPVVNYPL LSPPVPPRPL  240
ELGVGNFGGQ PGMGGEMYGA ADLLRSINTP TEADKPMIIE LAVAAMEELF RMAQMGQPLW  300
MTGLDGTTSV LNEEEYVRTF PRGIGPKPTG FKCEASRESA VVIMNHISLV EILMDVNQWS  360
TVFSGIVSRA MTLEVLSTGV AGNYNGALQV MTSEFQVPSP LVPTRESYYV RYCKQHGEGT  420
WAVVDVSLDN LRPSPAVRCR RRPSGCLIQE MPNGYSKVTW VEHVEVDDRG VHNLYKQLVS  480
SGHAFGAKRW VATLDRQCER LASVMATNIP TGEVGVITSQ DGRKSMLKLA ERMVISFCAG  540
VSASTAHTWT TLSGTGADDV RVMTRKSVDD PGRPPGIVLS AATSFWLPVP PKRVFDFLRD  600
ENTRSEWDIL SNGGVVQEMA HIANGRDTGN CVSLLRVNSA NSSQSNMLIL QESCTDATAS  660
FVIYAPVDIV AMNVVLNGGD PDYVALLPSG FAILPDGTSL HGGNIGETAS GGSLLTVAFQ  720
ILVDSVPTAK LSLGSVATVN NLIACTVERI KASLSCENA
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015383008.10.0PREDICTED: homeobox-leucine zipper protein HDG2 isoform X2
RefseqXP_006447539.10.0hypothetical protein CICLE_v10014373mg
RefseqXP_006447538.10.0hypothetical protein CICLE_v10014373mg
RefseqXP_006447537.10.0hypothetical protein CICLE_v10014373mg
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLV4W4610.0V4W461_9R
STRINGVIT_12s0059g02310.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2